首页> 外文OA文献 >A deep auto-encoder based low-dimensional feature extraction from FFT spectral envelopes for statistical parametric speech synthesis
【2h】

A deep auto-encoder based low-dimensional feature extraction from FFT spectral envelopes for statistical parametric speech synthesis

机译:基于深度自动编码器的FFT谱包络的低维特征提取,用于统计参数语音合成

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In the state-of-the-art statistical parametric speech synthesis system, a speech analysis module, e.g. STRAIGHT spectral analysis, is generally used for obtaining accurate and stable spectral envelopes, and then low-dimensional acoustic features extracted from obtained spectral envelopes are used for training acoustic models. However, a spectral envelope estimation algorithm used in such a speech analysis module includes various processing derived from human knowledge. In this paper, we present our investigation of deep autoencoder based, non-linear, data-driven and unsupervised low-dimensional feature extraction using FFT spectral envelopes for statistical parametric speech synthesis. Experimental results showed that a text-to-speech synthesis system using deep auto-encoder based low-dimensional feature extraction from FFT spectral envelopes is indeed a promising approach.
机译:在最新的统计参量语音合成系统中,语音分析模块,例如,语音分析模块,被称为语音分析模块。通常使用STRAIGHT频谱分析来获得准确而稳定的频谱包络,然后将从获得的频谱包络中提取的低维声学特征用于训练声学模型。但是,在这种语音分析模块中使用的频谱包络估计算法包括从人类知识中得出的各种处理。在本文中,我们介绍了对基于深度自动编码器,非线性,数据驱动和无监督的低维特征提取的研究,该提取使用FFT频谱包络进行统计参数语音合成。实验结果表明,使用基于深度自动编码器的低维特征提取(从FFT频谱包络中提取)的文本到语音合成系统确实是一种很有前途的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号